Prosomarker: a prosodic analysis tool based on optimal pitch stylization and automatic syllabi fication
نویسندگان
چکیده
Prosodic research in recent years has been supported by a number of automatic analysis tools aimed at simplifying the work that is requested to study intonation. The need to analyze large amounts of data and to inspect phenomena that are often ambiguous and difficult to model makes the prosodic research area an ideal application field for computer based processing. One of the main challenges in this field is to model the complex relations occurring between the segmental level, mainly in terms of syllable nuclei and boundaries, and the supra-segmental level, mainly in terms of tonal movements. The goal of our contribution is to provide a tool for automatic annotation of prosodic data, the Prosomarker, designed to give a visual representation of both segmental and suprasegmental events. The representation is intended to be as generic as possible to let researchers analyze specific phenomena without being limited by assumptions introduced by the annotation itself. A perceptual account of the pitch curve is provided along with an automatic segmentation of the speech signal into syllable-like segments and the tool can be used both for data exploration, in semi-automatic mode, and to process large sets of data, in automatic mode.
منابع مشابه
APA: towards an Automatic Tool for Prosodic Analysis
In this paper a tool for the speech signal prosodic analysis is described. The system APA (Automatic Prosodic Analysis) is based on a tool for speech segmentation into syllabic units and on their description in terms of pitch, energy and duration. A particular linear stylization of the fundamental frequency function is proposed, which helps in describing efficiently intonation movements at phra...
متن کاملAutomatic pitch contour stylization using a model of tonal perception
A new quantitative model of tonal perception for continuous speech is described. The paper illustrates its ability for automatic stylization of pitch contours, with applications to prosodic analysis and speech synthesis in mind, and evaluates it in a perception experiment. After a discussion of the psychoacoustics of tonal perception and an overview of existing tonal perception models and syste...
متن کاملProsody Annotation for Unit Selection Tts Synthesis
This paper concerns prosody annotation and intonation modeling, especially for the application in a corpus based speech synthesis. In order to establish the rules of the automatic intonation modeling, a four hour fully annotated speech database has been acoustically and perceptually analyzed. The speech material included different text types, dialogs and prosodically rich phrases. As the result...
متن کاملProsody annotation for corpus based speech synthesis
The paper concerns prosody annotation especially for application in a corpus based speech synthesis. In order to establish the rules of automatic intonation modelling, phonetically labeled speech database of 4 hours has been perceptually and acoustically analyzed. The speech material included different text types and prosodically rich phrases. The annotation of the speech database consists in p...
متن کاملCoPaSul Manual - Contour-based parametric and superpositional intonation stylization
The purposes of the CoPaSul toolkit are (1) automatic prosodic annotation and (2) prosodic feature extraction from syllable to utterance level. CoPaSul stands for contour-based, parametric, superpositional intonation stylization. In this framework intonation is represented as a superposition of global and local contours that are described parametrically in terms of polynomial coefficients. On t...
متن کامل